Spatio-textual Indexing for Geographical Search on the Web

نویسندگان

  • Subodh Vaid
  • Christopher B. Jones
  • Hideo Joho
  • Mark Sanderson
چکیده

Many web documents refer to specific geographic localities and many people include geographic context in queries to web search engines. Standard web search engines treat the geographical terms in the same way as other terms. This can result in failure to find relevant documents that refer to the place of interest using alternative related names, such as those of included or nearby places. This can be overcome by associating text indexing with spatial indexing methods that exploit geo-tagging procedures to categorise documents with respect to geographic space. We describe three methods for spatio-textual indexing based on multiple spatially indexed text indexes, attaching spatial indexes to the document occurrences of a text index, and merging text index access results with results of access to a spatial index of documents. These schemes are compared experimentally with a conventional text index search engine, using a collection of geo-tagged web documents, and are shown to be able to compete in speed and storage performance with pure text indexing.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

The SPIRIT Spatial Search Engine: Architecture, Ontologies and Spatial Indexing

The SPIRIT search engine provides a test bed for the development of web search technology that is specialised for access to geographical information. Major components include the user interface, geographical ontology, maintenance and retrieval functions for a test collection of web documents, textual and spatial indexes, relevance ranking and metadata extraction. Here we summarise the functiona...

متن کامل

Indexation sémantique et recherche d'information interactive

Among the various facets of Information Retrieval in textual data, the search for information located in space and time constitutes a full research field. Indeed, it requires, for indexing as for retrieval, specific linguistic analyses and resources. The present paper roots in the GéoSem project, whose aim is to develop advanced, semantic-based methods for geographical documents retrieval. Toda...

متن کامل

FAST: Frequency-Aware Spatio-Textual Indexing for In-Memory Continuous Filter Query Processing

The ubiquity of spatio-textual data comes from the popularity of GPS-enabled smart devices, e.g., smartphones. These devices provide a platform that supports a wide range of applications that generate and process spatio-textual data. These applications include social networks, micro-blogs, web-search for local attractions and events, and location-aware ad targeting. These applications need to p...

متن کامل

Demo Paper: A Spatio-Temporal-Textual Crime Search Engine

This paper proposes a STT(spatio-temporal-textual) search engine for extracting, indexing, querying and visualizing crime information. Until recently, it’s a labor-intensive work to identify crime entities, cluster similar suspect activities, and discover patterns from massive online collections. It’s a big challenge to reveal inherent ST(spatio-temporal) correlations among mass crime informati...

متن کامل

SKIF-P: a point-based indexing and ranking of web documents for spatial-keyword search

There is a significant commercial and research interest in location-based web search engines. Given a number of search keywords and one or more locations (geographical points) that a user is interested in, a location-based web search retrieves and ranks the most textually and spatially relevant web pages. In this type of search, both the spatial and textual information should be indexed. Curren...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2005